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20 Methods for Genetic Diversification 

in Gene Conversion Active Cells 

The present invention relates to a method for directed and selective genetic diversification of a 
target nucleic acid sequence or gene product by exploiting the relationship between 
25 immunoglobulin gene conversion and hypermutation in antibody-producing cells, as well as to 
cells and cell lines capable of said genetic diversification. 

Many approaches to the generation of diversity in gene products rely on the generation of a very 
large number of mutants which are then selected using powerful selection technologies. However, 

30 these systems have a number of disadvantages. If the mutagenesis is done in vitro on gene 

constructs which are subsequently expressed in vitro or as transgenes in cells or animals, the gene 
expression in the physiological context is difficult and the mutant repertoire is fixed in time. If 
mutagenesis is on the other hand performed in living cells, it is difficult to direct mutations to a 
target nucleic acid where they are desired. Therefore the efficiency of isolating molecules with 

35 improved activity by repeated cycles of mutations and selection with sufficient efficiency is 
limited. Moreover, random mutagenesis in vivo is toxic and likely to induce a high level of 
undesirable secondary mutations. 



2 



In nature, directed diversification of a selected nucleic acid sequence takes place in the rearranged 
V(D)J segments of the immunoglobulin (Ig) gene loci. The primary repertoire of antibody 
specificities is generated by a process of DNA rearrangement involving the joining of 
immunoglobulin V, D, and J gene segments. Following antigen encounter, the rearranged V(D)J 
5 segments in those B cells, whose surface Ig can bind the antigen with low or moderate affinity, are 
subjected to a second wave of diversification by hypermutation. This so-called somatic 
hypermutation generates the secondary repertoire from which increased binding specificities are 
selected thereby allowing affinity maturation of the humoral immune response (Milstein and Rada, 
1995). 

10 

The mouse and man immunoglobulin loci contain large pools of V, D and J gene segments which 
can participate in the V(D)J rearrangement, so that significant diversity is created at this stage by 
random combination. Other species such as chicken, rabbit, cow, sheep and pig employ a different 
strategy to develop their primary Ig repertoire (Butler, 1998). After the rearrangement of a single 

15 functional V and J segment, further diversification of the chicken light chain gene occurs by gene 
conversion in a specialized lymphoid organ, the Bursa of Fabricius (Reynaud et al., 1987; 
Arakawa and Buerstedde, in press). During this process, stretches of sequences from non- 
functional pseudo-V-genes are transferred into the rearranged V-gene. The twenty-five pseudo-V- 
genes are situated upstream of the functional V-gene and share sequence homology with the V- 

20 gene. Similar to the situation in men and mice, affinity maturation after antigen encounter takes 
place by hypermutation in the splenic germinal centers of the chicken (Arakawa et al., 1996). 

All three B cell specific activities of Ig repertoire formation - gene conversion (Arakawa et al., 
2002), hypermutation and isotype switch recombination (Muramatsu et al., 2000; Revy et al., 

25 2000) - require expression of the Activation Induced Deaminase (AID) gene. Whereas it was 

initially proposed that AID is a DNA editing enzyme (Muramatsu et al., 1999), more recent studies 
indicate that AID directly modifies DNA by deamination of cytosine to uracil (Di Noia and 
Neuberger, 2002). However, the cytosine deamination activity must be further regulated, because 
only differences in the type, the location and the processing of the AID-induced DNA 

30 modification can explain the selective occurrence of recombination or hypermutation in different 
species and B cell environments. Based on the finding that certain AID mutations affect switch 
recombination, but not somatic hypermutation, it was suggested that AID needs the binding of a 
co-factor to start switch recombination (Ta et al., 2003; Barreto et al., 2003). 

35 Analysis of DT40 knock-out mutants indicates that the RAD54 gene (Bezzubova et al., 1997) and 
other members of the RAD52 recombination repair pathway are needed for efficient Ig gene 
conversion (Sale et al., 2001). Disruption of RAD51 analogues and paralogues reduces Ig gene 
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conversion and induces hypermutation in the rearranged light chain gene (Sale et al., 2001) 
suggesting that a defect in DNA repair by homologous recombination can shift Ig gene conversion 
to hypermutation. 

5 Recently, first cell systems have been developed which exploit the phenomenon of somatic t . 
hypermutation in the immunoglobulin locus to generate mutants of a target gene in constitutive 
and directed manner. These cell systems allow to prepare a gene product having a desired activity 
by cyclical steps of mutation generation and selection. Thus, WO 00/221 1 1 and WO 02/100998 
describe a human Burkitt lymphoma cell line (Ramos) which is capable of directed constitutive 
10 hypermutation of a specific nucleic acid region. This mutated region can be the endogenous 

rearranged V segment or an exogenous gene operatively linked to control sequences which direct 
hypermutation. A significant disadvantage of this cell system is that human cells cannot be 
efficiently genetically manipulated by targeted integration, since transfected constructs insert 
primarily at random chromosomal positions. 

15 

WO 02/100998 also describes another cell system for generating genetic diversity in the Ig locus 
which is based on the chicken B cell line DT40. DT40 continues gene conversion of the 
rearranged light chain immunoglobulin gene during cell culture (Buerstedde et al., 1990). 
Importantly, this cell line has a high ratio of targeted to random integration of transfected 

20 constructs thus allowing efficient genetic manipulation (Buerstedde and Takeda, 1991). According 
to WO 02/100998, deletion in DT40 of the paralogues of the RAD51 gene which are involved in 
homologous recombination and DNA repair led to a decrease in gene conversion and a 
simultaneous activation of hypermutation of the rearranged V segment. However, the main 
disadvantage of this system is that the mutant cells have a DNA repair deficiency as reflected by 

25 X-ray sensitivity and chromosomal instability. The mutants also have a low proliferation rate and a 
low gene targeting efficiency. Therefore this system is poorly suited for efficient gene 
diversification and selection. 

The present invention overcomes the disadvantages of the prior art systems and provides further 
30 advantages as well. 

SUMMARY OF THE INVENTION 

i 

In the first aspect of the invention there is provided a genetically modified lymphoid cell having 
35 gene conversion fully or partially replaced by hypermutation, wherein said cell has no deleterious 
mutations in genes encoding paralogues and analogues of the RAD5 1 gene which encode 
important homologous recombination factors. Specifically, the cell contains wild-type homologous 



4 



recombination factors. Due to the intact homologous recombination machinery, the cell according 
to the invention is recombination and repair proficient and has a normal proliferation rate. 

The cell of the invention is an immunoglobul in-expressing B lymphocyte derived from animal 
5 species which use the mechanism of gene conversion for developing their immunoglobulin 

repertoire. These species are for example chicken, sheep, cow, pig and rabbit. Preferably, the cell 
is derived from a chicken Bursal lymphoma. Most preferably, the cell is derived from or related to 
the DT40 cell line. 

10 In a further embodiment, the cell according to the invention is capable of directed and selective 
genetic diversification of a target nucleic acid by hypermutation or a combination of 
hypermutation and gene conversion. The target nucleic acid may encode a protein or possess a 
regulatory activity. Examples of proteins are an immunoglobulin chain, a selection marker, a 
DNA-binding protein, an enzyme, a receptor protein or a part thereof. In a preferred embodiment, 

15 the target nucleic acid is the V(D)J segment of a rearranged human immunoglobulin gene. 

Examples of regulatory nucleic acids are a transcription regulatory element or a RNAi sequence. 

In an embodiment, in which the target nucleic acid is diversified by a combination of 
hypermutation and gene conversion, the cell according to the invention contains at least one 
20 sequence capable of serving as a gene conversion donor for the target nucleic acid. 

In a further embodiment, the target nucleic acid is an exogenous nucleic acid operably linked to 
control nucleic acid sequences that direct genetic diversification. 

25 In an additional embodiment, the target nucleic acid is expressed in the cell according to the 
invention in a manner that facilitates selection of cells which exhibit a desired activity. The 
selection can be a direct selection for the activity of the target nucleic acid within the cell, on the 
cell surface or outside the cell. Alternatively, the selection can be an indirect selection for the 
activity of a reporter nucleic acid. 

30 

In a further embodiment, the invention provides for genetic means to modulate the genetic 
diversification of the target nucleic acid in the cell according to the invention. The modulation can 
be by modification of cis-acting regulatory sequences, by varying the number of gene conversion 
donors, or by modification of trans-acting regulatory factors such as activation-induced deaminase 
35 (AID) or a DNA repair or recombination factor other than a RAD5 1 analogue or paralogue. The 
cell preferably expresses activation-induced deaminase (AID) conditionally. 
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In a second aspect, there is provided a cell line derived from a cell according to the invention. In a 
preferred embodiment, the cell line is DT40 or a modification thereof. 

In a third aspect, there is provided a transgenic non-human animal containing a lymphoid cell 
5 having gene conversion fully or partially replaced by hypermutation, wherein said cell has no 
deleterious mutations in genes encoding paralogues and analogues of the RAD51 protein, and 
wherein said cell is capable of directed and selective genetic diversification of a transgenic target 
nucleic acid by hypermutation or a combination of hypermutation and gene conversion. In a 
preferred embodiment, the animal is chicken. 

10 

In a further aspect, the invention provides a method for preparing a cell capable of directed and 
selective genetic diversification of a target nucleic acid by hypermutation or a combination of 
hypermutation and gene conversion. The method comprises (a) transfecting a lymphoid cell 
capable of gene conversion with a genetic construct containing the target nucleic acid, and (b) 
15 identifying a cell having the endogenous V-gene segment of a part thereof replaced with the target 
nucleic acid. 

According to a further embodiment, the genetic construct containing the target nucleic acid further 
contains at least one nucleic acid capable of serving as a gene conversion donor for the target 
20 nucleic acid. The locus containing the target nucleic acid can be constructed by a single 

transfection or multiple rounds of transfection with constructs containing different components of 
the locus. 



In the embodiment, in which selection for a cell with a desired activity is indirect, the method of 
25 the invention further comprises (c) transfecting the cell from step (b) with a further genetic 
construct comprising a reporter gene capable of being influenced by the target nucleic acid. 

In a further embodiment, the method of the invention further comprises (d) conditional expression 
of a trans-acting regulatory factor. In a preferred embodiment, the trans-acting regulatory factor is 
30 activation-induced deaminase (AID). 

According to a particularly preferred embodiment, the target nucleic acid is inserted into the cell 
by targeted integration. 

35 In a further aspect, there is provided a method for preparing a gene product having a desired 

activity, comprising the steps of: (a) culturing cells according to the invention under appropriate 
conditions to express the target nucleic acid, (b) identifying a cell or cells within the population of 
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cells which expresses a mutated gene product having the desired activity; and (c) establishing one 
or more clonal populations of cells from the cell or cells identified in step (b), and selecting from 
said clonal populations a cell or cells which expresses a gene product having an improved desired 
activity. 

5 

In one embodiment, steps (b) and (c) are iteratively repeated until a gene product with an 
optimized desired activity is produced. 

According to a further embodiment, the genetic diversification can be switched off, for example, 
10 by down-regulation of the expression of a trans-acting regulatory factor, when the cell producing a 
gene product with an optimized desired activity has been identified. The trans-acting regulatory 
factor can be, for example, activation-induced deaminase (AID) or a factor involved in 
homologous recombination or DNA repair, other than a RAD51 paralogue or analogue. 

15 In another embodiment, the diversification of the target nucleic acid is further modified by target 
sequence optimization such as the introduction of Ig hypermutation hotspots or an increased GC 
content. 

In a further aspect of the present invention, there is provided the use of a cell capable of directed 
20 and selective genetic diversification of a target nucleic acid by hypermutation or a combination of 
hypermutation and gene conversion for the preparation of a gene product having a desired activity. 

DESCRIPTION OF THE FIGURES 

25 Fig. 1 \|/V gene deletion (A) A physical map of the chicken rearranged Ig light chain locus 

and the V|/V knock-out constructs. The locus contains a total of 25 \|/V genes upstream of 
functional V segment. The knock-out strategy of ij/V genes by the targeted integration of the 
p\|/VDell-25 and the pyVDel3-25 constructs is shown below. Only the relevant EcoRI sites are 
indicated. (B) Southern blot analysis of wild-type and knock-out clones using the probe shown in 

30 (A) after Eco RJ digestion. The wild-type locus hybridizes as a 12-kb fragment, whereas 

yypartki ^ hybridize as a 7.4-kb and 6.3-kb fragment, respectively. (C) AID status. The 

AID gene was amplified by PCR to verify the presence or absence of AID cDNA expression 
cassette. 

35 Fig. 2 slgM expression analysis of control and \j/V knock-out clones (A) FACS anti-IgM 

staining profiles of representative subclones derived from initially sIgM(+) clones. (B) Average 
percentages of events falling into slgM(-) gates based on the measurement of 24 subclones. 
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Fig. 3 Ig light chain sequence analysis of the Xj/V knock-out clones Mutation profiles of 
the AIDVV* (SEP ID NO: 1) and AIDVV 1 ^ (SEP ID NO: 2) clones. All nucleotide 
substitutions identified in different sequences in the region from the leader sequence to the J-C 
5 intron are mapped onto the rearranged light chain sequence present in the AID R precursor clone. 
Mutations of the AID%/V and the AID%/ V partial clones are shown above and below the reference 
sequence, respectively. Deletions, insertions and gene conversion events are also indicated. 
Hotspot motifs (RGYW and its complement WRCY) are highlighted by bold letters. 

10 Fig. 4 Mutation profiles of hypermutating cell lines (A) Percentages of sequences 

carrying a certain number of mutations. Each untemplated nucleotide substitution is counted, but 
gene conversion, deletions and insertions involving multiple nucleotides are counted as a single 
event. PM, point mutation; GC, gene conversion; D, deletion; I, insertion. (B) Pattern of nucleotide 
substitutions within sequences from vj/ V and the XRCC3 knock-out clones. Nucleotide 

15 substitutions as part of gene conversion events are excluded. The ratios of transition (trs) to 

transversion (trv) are also shown. (C) Hotspot preference of untemplated nucleotide substitution 
mutations. Mutations occurring within a hotspot motif (RGYW or its complement WRCY) are 
shown by percentages. (D) Trypan-blue positive cells as an indicator of spontaneously dying cells. 

20 Fig. 5 Distribution of nucleotide substitutions within genomic sequences from unsorted 

AIDVV" cells and within cDNA sequences from sorted IgM (-) AID^V" cells The number of 
mutations are counted for every.50 bp, and are shown together with the corresponding physical 
maps of the light chain genomic locus or the cDNA sequence. 

25 Fig. 6 A model explaining the regulation of Ig gene conversion and Ig hypermutation 

Fig. 7 In situ mutagenesis of the GFP gene (A) Ig VJ replacement vector. (B) in vivo 
mutagenesis of the GFP gene by hypermutation. (C) \|/V donor replacement vector. (D) in vivo 
mutagenesis of GFP gene by gene conversion and hypermutation. 

30 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention makes available a particularly useful cell system for directed and selective, 
genetic diversification of any nucleic acid by hypermutation or a combination of hypermutation 
35 and gene conversion. The system is based on B cell lines which constitutively diversify the 
rearranged immunoglobulin V-gene in vitro without requiring extracellular stimuli such as an 
interaction with other cells or molecules or maintenance of the B cell antigen receptor. 
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As used herein, "directed and selective diversification" refers to the ability of certain cells to cause 
alteration of the nucleic acid sequence of a specific region of endogenous or transgenic nucleic 
acid, whereby sequences outside of these regions are not subjected to mutation. 

5 

"Genetic diversification" refers to alteration of individual nucleotides or stretches of nucleotides in 
a nucleic acid. Genetic diversification in the cells according to the invention occurs by 
hypermutation, gene conversion or a combination of hypermutation and gene conversion. 

10 "Hypermutation" refers to the mutation of a nucleic acid in a cell at a rate above background. 

Preferably, hypermutation refers to a rate of mutation of between 10" 5 and 10* 3 bp* 1 generation l . 

This is greatly in excess of background mutation rates, which are of order of 10" 9 to 10" 10 

mutations bp* 1 generation" 1 (Drake et al. 1988) and of spontaneous mutations observed in PCR. 

Thirty cycles of amplification with Pfu polymerase would produce <0.05xl0" 3 mutations bp" 1 in 
15 the product, which in the present case would account for less than 1 in 100 of the observed 

mutations (Lundberg et ah, 1991). 

"Gene conversion" refers to a phenomenon in which sequence information is transferred in 
unidirectional manner from one homologous allele to the other. Gene conversion may be the result 
20 of a DN A polymerase switching templates and copying from a homologous sequence, or the result 
of mismatch repair (nucleotides being removed from one strand and replaced by repair synthesis 
using the other strand) after the formation of a heteroduplex. 

Hypermutation and gene conversion generate natural diversity within the immunoglobulin V(D)J 
25 segment of B cells. Hypermutation takes place in the germinal centers of such species as mouse 
and human following antigen stimulation. Gene conversion takes place in primary lymphoid 
organs like the Bursa of Fabricius or the gut-associated lymphoid tissue in such species as chicken, 
cow, rabbit, sheep and pig independent of antigen stimulation. In chicken, stretches from the 
upstream pseudo-V-genes are transferred into the rearranged V(D)J segment. According to the 
30 present invention, therefore, the cell or cell line is preferably an immunoglobulin-producing cell or 
cell line which is capable of diversifying its rearranged immunoglobulin genes. 

A direct connection between the initiation of hypermutation and gene conversion is for the first 
time established in the experiments reported herein. Specifically, partial or complete deletion of 
35 pseudo-V-genes in a cell line which continues gene conversion in cell culture leads to the 

activation of hypermutation in the immunoglobulin locus. Deletion of all pseudogenes results in 
the abolishment of gene conversion and simultaneous activation of high rates of hypermutation, 
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whereas deletion of a few pseudogenes results in the down-regulation of gene conversion and 
simultaneous activation of hypermutation at rates lower than the ones observed for the complete 
pseudogene deletion. Therefore, the number of available pseudogene donors directly correlates 
with gene conversion rates and inversely correlates with hypermutation rates. Gene conversion and 
5 hypermutation are established to be in a reciprocal relationship to each other. Thus, the present 
invention for the first time provides a cell system which allows to genetically diversify a target 
nucleic acid by a combination of hypermutation and gene conversion, whereby the contribution of 
these two phenomena can be regulated by changing the number of the gene conversion donors, 
their orientation or their degree or length of homology. 

10 

An advantage of the cell system according to the invention over a cell system with only 
hypermutating activity such as the one based on the human Burkitt lymphoma cell line Ramos 
(WO 00/221 1 1 and WO 02/100998) is the ability to combine genetic diversification by 
hypermutation and gene conversion in one cell. For example, more defined changes can be 

15 introduced into the target gene by gene conversion than by random hypermutation, since gene 
conversion donors can be engineered to contain sequences likely to influence the target nucleic 
acid activity in a favorable way. Gene conversion and hypermutation might thus increase the 
chance to produce desirable variants, since pre-tested sequence blocks are combined with random 
hypermutations. Pseudogenes with sequences identical to a certain region of the target gene can 

20 also be used to keep a part of the target nucleic acid stable by frequent conversions having the 
effect that the hypermutations persist only in the non-converting part. This approach is useful 
when the target nucleic acid contains region which should remain stable for optimal activity. 

An advantage of the cell system according to the invention over a cell system based on the 
25 suppression of homologous recombination activity in gene conversion active cells (WO 
02/100998) is genetic stability of the cell reflected in a normal proliferation rate, radiation 
resistance and DNA repair competence. 

A particular advantage of the present cell system over all known systems is the ability of the cells 
30 according to the invention to integrate transfected nucleic acid constructs by targeted integration 
into the homologous endogenous locus. 

"Targeted integration" is integration of a transfected nucleic acid construct comprising a nucleic 
acid sequence homologous to an endogenous nucleic acid sequence by homologous recombination 
35 into the endogenous locus. Targeted integration allows to directly insert any nucleic acid into a 
defined chromosomal position. In a preferred embodiment, a nucleic acid encoding a gene product 
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of interest is inserted by targeted integration into the immunoglobulin locus in place of the 
rearranged V(D)J segment or a portion thereof. 

In a preferred embodiment, the cells according to the invention are derived or related to cells 
5 which undergo Ig gene conversion in vivo. Cells which undergo Ig gene conversion in vivo are, for 
example, surface Ig expressing B cells in primary lymphoid organs such as avian Bursal B cells. 
Lymphoma cells, derived from B cells of primary lymphoid organs, are particularly good 
candidates for constructing cells and cell lines according to the present invention. In the most 
preferred embodiment, the cells are derived from a chicken Bursal lymphoma cell line DT40. 

10 

The process of constitutive genetic diversification by hypermutation and gene conversion is used 
in the present invention to produce gene products with a desired, novel or improved, activity. 

A "target nucleic acid" is a nucleic acid sequence or chromosomal region in the cell according to 
15 the present invention which is subjected to direct and selective genetic diversification. The target 
nucleic acid can be either endogenous or transgenic and may comprise one or more transcription 
units encoding gene products. 



As used herein, a "transgene" is a nucleic acid molecule which is inserted into a cell, such as by 
20 transfection or transduction. For example, a transgene may comprise a heterologous transcription 
unit which may be inserted into the genome of a cell at a desired location. . 

In one embodiment, transgenes are immunoglobulin V-genes as found in immunoglobulin- 
producing cells or fragments of V-genes. Preferably, the target nucleic acid is a human 
25 immunoglobulin V-gene. In this case, the cells according to the invention are "factories" of human 
antibody variants capable of binding to any given antigen. 

Alternatively, the target nucleic acid is a non-immunoglobulin nucleic acid, for example a gene 
encoding selection markers, DNA-binding proteins, enzymes or receptor proteins. For example, a 
30 novel fluorescent selection marker can be produced by mutating a known marker by 

hypermutation or by a combination of hypermutation and gene conversion with help of other 
known markers with a different fluorescent spectrum serving as gene conversion donors. 

In one embodiment of the invention, the target nucleic acid directly encodes a gene product of 
35 interest. Gene diversification of such a nucleic acid will result in a truncation of the encoded gene 
product or in a change of its primary sequence. With every round of diversification and selection, a 
cell expressing the gene product with an improved activity is search for. 
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Alternatively, the target nucleic acid is a regulatory element, for example, a transcription 
regulatory element such as promoter or enhancer, or interfering RNA (RNAi). In this embodiment, 
an additional nucleic acid (reporter gene) which is influenced by the target nucleic acid and 
5 encodes an identifiable gene product is required to identify cells bearing the target nucleic acid of 
interest. 

In the embodiment, in which genetic diversification of the target nucleic acid takes place by a 
combination of hypermutation and gene conversion, additional nucleic acids capable of serving as 
10 gene conversion donors are inserted into the cell genome, preferably upstream of the target nucleic 
acid. 

A "nucleic acids capable of serving as a gene conversion donor" is a nucleic acid having a 
sequence homologous to the target nucleic acid. Examples of natural gene conversion donors are 
15 pseudo-V-genes in the immunoglobulin locus of certain species. 

According to one embodiment of the invention, a cell capable of directed and selective 
diversification of the target nucleic acid is constructed by inserting the target nucleic acid into the 
host cell by targeted integration at a defined chromosomal site. For this purpose, the transfected 
20 constructs may contain upstream and downstream of the target nucleic acid sequences homologous 
to the desired chromosomal integration site. Preferably, the cell is constructed by replacing the 
endogenous V-gene or segments thereof with a transgene by homologous recombination, or by 
gene targeting, such that the transgene becomes a target for the gene conversion and/or 
hypermutation events. 

25 

In another embodiment, transgenes according to the invention also comprise sequences which 
direct hypermutation and/or gene conversion. Thus, an entire locus capable of expressing a gene 
product and directing hypermutation and gene conversion to this transcription unit is transferred 
into the cells and is actively diversified even after random chromosomal integration. 

30 

Screening of clones having incorporated the transgene by targeted integration can be done by 
Southern blot analysis or by PCR. 

In a preferred embodiment, transgenes according to the invention contain a selectable marker gene 
35 which allows selection of clones which have stablely integrated the transgene. This selectable 
marker gene may subsequently be removed by recombination or inactivated by other means. 
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The present invention further provides a method for preparing a gene product having a desired 
activity by repeated rounds of cell expansion and selection for cells bearing a target nucleic acid 
with a desired activity. As used herein, "selection" refers to the determination of the presence of 
sequence alterations in the target nucleic acid which result in a desired activity of the gene product 
5 encoded by the target nucleic acid or in a desired activity of the regulatory function of the target 
nucleic acid. 

The process of gene conversion and hypermutation is employed in vivo to generate improved or 
novel binding specificities in immunoglobulin molecules. Thus, by selecting cells according to the 
10 invention which produce immunoglobulins capable of binding to the desired antigen and then 
propagating these cells in order to allow the generation of further mutants, cells which express 
immunoglobulins having improved binding to the desired antigen may be isolated. 

A variety of selection procedures may be applied for the isolation of mutants having a desired 
15 specificity. These include Fluorescence Activated Cell Sorting (FACS), cell separation using 

magnetic particles, antigen chromatography methods and other cell separation techniques such as 
use of polystyrene beads. 

Fluorescence Activated Cell Sorting (FACS) can be used to isolate cells on the basis of their 

20 differing surface molecules, for example surface displayed immunoglobulins. Cells in the sample 
or population to be sorted are stained with specific fluorescent reagents which bind to the cell 
surface molecules. These reagents would be the antigen(s) of interest linked (either directly or 
indirectly) to fluorescent markers such as fluorescein, Texas Red, malachite green, green 
fluorescent protein (GFP), or any other fluorophore known to those skilled in the art. The cell 

25 population is then introduced into the vibrating flow chamber of the FACS machine. The cell 
stream passing out of the chamber is encased in a sheath of buffer fluid such as PBS (Phosphate 
Buffered Saline). The stream is illuminated by laser light and each cell is measured for 
fluorescence, indicating binding of the fluorescent labeled antigen. The vibration in the cell stream 
causes it to break up into droplets, which carry a small electrical charge. These droplets can be 

30 steered by electric deflection plates under computer control to collect different cell populations 
according to their affinity for the fluorescent labeled antigen. In this manner, cell populations 
which exhibit different affinities for the antigen(s) of interest can be easily separated from those 
cells which do not bind the antigen. FACS machines and reagents for use in FACS are widely 
1 available from sources world-wide such as Becton-Dickinson, or from service providers such as 

35 Arizona Research Laboratories (http://www.arl.arizona. e du/facs/) . 
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Another method which can be used to separate populations of cells according to the affinity of 
their cell surface protein(s) for a particular antigen is affinity chromatography. In this method, a 
suitable resin (for example CL-600 Sepharose, Pharmacia Inc.) is covalently linked to the 
appropriate antigen. This resin is packed into a column, and the mixed population of cells is passed 
5 over the column. After a suitable period of incubation (for example 20 minutes), unbound cells are 
washed away using (for example) PBS buffer. This leaves only that subset of cells expressing 
immunoglobulins which bound the antigen(s) of interest, and these cells are then eluted from the 
column using (for example) an excess of the antigen of interest, or by enzymatically or chemically 
cleaving the antigen from the resin. This may be done using a specific protease such as factor X, 
10 thrombin, or other specific protease known to those skilled in the art to cleave the antigen from the 
column via an appropriate cleavage site which has previously been incorporated into the antigen- 
resin complex. Alternatively, a non-specific protease, for example trypsin, may be employed to 
remove the antigen from the resin, thereby releasing that population of cells which exhibited 
affinity for the antigen of interest. 

15 

The present invention provided for the first time a mechanism which allows to regulate genetic 
diversification of the target nucleic acid. As demonstrated by the present inventors, activation- 
induced deaminase (AID) is a factor which regulates gene conversion as well as hypermutation in 
the immunoglobulin locus. It is suggested that AID induces a common modification in the 

20 rearranged V(D)J segment leading to a conversion tract in the presence of adjacent donor 

sequences and to a point mutation in their absence. Therefore, by regulation of AID expression, 
both phenomena can be modulated. In a preferred embodiment, the AID gene is transiently 
expressed in the cell containing a target nucleic acid. For example, AID can be expressed under a 
drug-responsive promoter such as the tetracycline responsible gene expression system. Otherwise 

25 the gene expression may be shut down by the excision of the AID expression cassette by induced 
recombination. Switching off the AID expression will prevent further diversification of the target 
sequence. Preferably, AID expression is switched off in the cell producing a gene product with a 
desired activity in order to prevent further mutations which can lead to the loss of the desired 
activity. 

30 

The invention is illustrated by the following examples. 



35 EXAMPLES 

1. AID initiates immunoglobulin gene conversion and hypermutation by a 



14 



common intermediate 



Herein it is reported that ablation of yV donors activates AID-dependent Ig hypermutation in 
chicken B ceil line DT40. This shows that Ig gene conversion and hypermutation are competing 
5 pathways derived from the same AID-initiated intermediate. Furthermore V|/V knock-out DT40 is 
proposed as an ideal model system to approach the molecular mechanism of Ig hypermutation and 
as a new tool for in situ mutagenesis. 

Methods 

10 

Cell lines. DT40 Crel which displays increased Ig gene conversion due to a v-myb transgene 
and contains a tamoxifen inducible Cre recombinase has been described previously (Arakawa et 
al., 2001). DT40 Crel AID" / ~ was generated by the targeted disruption of both AID alleles of DT40 Crel 
(Arakawa et al., 2002). AID R was derived from DT40 Cre, AID" A after stable integration of a floxed 

15 AID-IRES-GFP bicistronic cassette, in which both AID and GFP are expressed from the same p- 
actin promoter. AID R i|/V~ was derived from AID R by transfection of pyVDell-25 (Fig. 1A). Stable 
transfectants which had integrated the construct into the rearranged light chain locus were then 
identified by locus specific PCR. Targeted integration of pv|/VDell-25 results in the deletion of the 
entire vj/V gene loci starting 0.4 kb upstream of \|/V25 and ending one bp downstream of i|/Vl. 

20 AID%/V partiaI was produced in a similar way as AID R \j/V~ by transfection of p\j/VDel3-25 which 
upon targeted integration leads to a partial deletion of the \}/V loci starting 0.4 kb upstream of 
\j/V25 and ending one bp downstream of \|/V3. Cell culture and electroporation were performed as 
previously described (Arakawa et al., 2002). XRCC3"'" was derived from DT40 Crcl by deleting 
amino acids 72- 170 of XRCC3 gene following transfection of XRCC3 knock-out constructs. 

25 Clones having undergone targeted integration were initially identified by long-range PCR and the 
XRCC3 deletion was then confirmed by Southern blot analysis. 



Ig reversion assay. Subcloning, antibody staining, flow cytometry and quantification of 
slgM expression has been described previously (Arakawa et al., 2002). All clones used in the 
30 study were sIgM(+) due to the repair of the light chain frameshift of the original C118(-) variant 
(Buerstedde et al., 1990) by a gene conversion event. 



PCR. To minimize PCR-introduced artificial mutations, PfuUltra hotstart polymerase 
(Stratagene) was used for amplification prior to sequencing. Long-range PCR, RT-PCR and Ig 
35 light chain sequencing were performed as previously described (Arakawa et al., 2002). The 
promoter and J-C intron region of Ig light chain plasmid clones were sequenced using the M 13 
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forward and reverse primers. Bu-1 and EFla genes were amplified using BU1/BU2 (BUI, 
GGG AAGCTTGATCATTTCCTG AATGCTATATTCA (SEP ID NO: 13) : BU2, 
GGGTCTAGAAACTCCTAGGGGAAACTTTGCTGAG (SEP ID NO: 14)) and EF6/EF8 (EF6, 
GGGAAGCTTCGGAAGAAAG AAGCTAAAGACCATC (SEP ID NO: 15) : EF8, 
5 GGGGCTAGCAGAAGAGCGTGCTCACGGGTCTGCC (SEP ID NP: 16)) primer pairs, 
respectively. The PCR products of these genes were cloned into the pBluescript plasmid vector, 
and were sequenced using the M 1 3 reverse primer. 

Results 

10 

Targeted deletion of \j/V donor sequences in the rearranged light chain locus 

Two \|/V knock-out constructs were made by cloning genomic sequences, which flank the 
intended deletion end points, upstream and downstream of a floxed-gpt (guanine phosphoribosyl 
transferase) cassette (Arakawa et a!., 2001). Upon targeted integration, the first construct, 

15 p\|/VDell-25, deletes all pseudogenes (v|/V25 to V|/V1) whereas the second construct, p\|/VDel3-25, 
deletes most pseudogenes (yV25 to v|/V3) (Fig. 1 A). A surface IgM positive (sIgM(+)) clone, 
derived from DT40 Crcl AID" 7 ' cells (Arakawa et al., 2002) by transfection and stable integration of a 
floxed AID-IRES (internal ribosome entry site) -GFP transgene, was chosen for the transfection of 
the v|/V knock-out constructs. This AID reconstituted clone, named AID R , has the advantage that 

20 the appearance of deleterious Ig light chain mutations can be easily detected by the loss of slgM 
expression and that GFP-marked AID expression can be shut down after tamoxifen induction of 
the Cre recombinase transgene inherited from DT40 Crel (Arakawa et al., 2002). 

Following transfection of the v|/V knock-out constructs into the AID R clone, mycophenolic 
acid resistant clones containing targeted deletions of the rearranged light chain locus were 

25 identified. These primary \|/V knock-out clones contain two floxed transgenes, the inserted gpt 
marker gene in the rearranged light chain locus and the AID-IRES-GFP gene of the AID R 
progenitor clone. Since the gpt gene might perturb the adjacent transcription or chromatin 
configuration, the primary \j/V knock-outs were exposed to a low concentration of tamoxifen and 
then subcloned by limited dilution. In this way, secondary \|/V knock-out clones could be isolated 

30 which had either deleted only the gpt gene (AID R yV" and AIDVv 1 ^) or the gpt gene together 
with the AID-IRES-GFP gene (AID~ A \|/V~and AID^m/V^). The disruption of i|/V genes in the 
rearranged light chain locus and the excision of AID over-expression cassette were confirmed by 
Southern blot analysis (Fig. IB) and PCR (Fig. 1C), respectively. 

35 Increased loss of slgM expression after deletion of \j/V genes in AID positive clones 
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To estimate the rates of deleterious Ig mutations, slg expression was measured by FACS 
after two weeks culture for 24 subclones each of the DT40 Crel , AID R , DT40 Cre, AID' / *and V|/V 
knock-out clones (Figs. 2 A and 2B). Analysis of the controls with the intact vj/V locus revealed an 
average of 0.52% and 2.27% slgM(-) cells for the DT40 CreI and AID R subclones respectively, but 
5 only 0.08% for the DT40 Crel AID" / \ Previous analysis of spontaneously arising slgM(-) DT40 
variants demonstrated that about a third contained frameshift mutations in the rearranged light 
chain V segment which were regarded as byproducts of the Ig gene conversion activity 
(Buerstedde et al., 1990). This view is now supported by the finding that the AID negative 
DT40 Crel AID" 7 " clone, which should have lost the Ig gene conversion activity, stably remains 
10 sIgM(+). Most interestingly, subclones of the AID positive yV knock-out clones (AID R v|/V partial 
and AID R \|/V) rapidly accumulate slgM(-) populations whereas subclones of the AID negative V|/V 
knock-out clones (AID^n/V^* 1 and AID"'"\|/V) remain sIgM(+) (Figs. 2A and 2B). This suggests 
that the deletion of the pseudogenes dramatically increases the rate of deleterious light chain 
mutations in AID expressing cells. 

15 

Replacement of Ig gene conversion by hypermutation in the absence of i|/V donors 

To analyze the newly identified mutation activity, the rearranged light chain VJ segments of 
the V|/V knock-out clones were sequenced 5-6 weeks after subcloning. A total of 135 nucleotide 
changes (Fig. 4A, Table 1) were found in the 0.5 kb region between the V leader and the 5' end of 

20 the J-C intron within 95 sequences from the AID R \j/V clone (Fig. 3, above reference sequence). In 
contrast to the conversion tracts seen in wild-type DT40 cells, almost all changes are single base 
substitutions and apart from a few short deletions and di-nucleotide changes, mutation clusters 
were not observed. The lack of conversion events in AID R \|/V\ which still contains the V|/V genes 
of the unrearranged light chain locus, confirms that Ig gene conversion only recruits the v|/V genes 

25 on the same chromosome for the diversification of the rearranged light chain gene (Carlson et al., 
1990). No sequence diversity was found in a collection of 95 light chain gene sequences from the 
AID^Vj/V" clone (Fig. 4A, Table 1), indicating that AID is required for the mutation activity. 

Sequences derived from the AID^V^ 31 clone occasionally display stretches of mutations 
which can be accounted for by the remaining vj/Vl and i|/V2 (Fig. 3, below reference sequence). 

30 Nevertheless, the majority of AID R \|/V partial mutations are single untemplated base substitutions as 
seen with the AID R v|/V" cells (Fig. 4A, Table 1). Only 3 base substitutions, which possibly are PCR 
artifacts, were found in 92 sequences of the AlD^'yV 9 * 1 ** 1 clone confirming that both the gene 
conversion and the mutation activities of AID^Y 153 ** 31 are AID dependent. 



35 



The new mutation activity of the \|fV knock-out clones closely resembles somatic 
hypermutation 
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The discovered Ig mutation activity in the \|/V knock-out clones with a predominance of 
single nucleotide substitutions suggests that somatic hypermutation had replaced Ig gene 
conversion. There is however a difference between the nucleotide substitutions in the AID R v|/V partial 
and AID R v|/V" clones and Ig hypermutations in germinal center B cells in that the clones show very 
5 few mutations in A/T bases and a preference for trans version mutations (Fig. 4B). 

Ig hypermutations are typically localized within one kb of the transcribed gene sequence 
with preferences for the Complementary Determining Regions (CDRs) of the V(D)J segments, 
whereas no or few mutations are present in the downstream C region (Lebecque and Gearhart, 
1990). To investigate whether the mutations in the AID R \j/V" clone follow a similar distribution, 

10 sequence analysis was extended to the promoter region and the J-C intron of the rearranged light 
chain gene (Fig. 5). Although mutations are found close to the promoter and in the intron 
downstream of the J segments, the peak incidence clearly coincides with the CDR1 and CDR3, 
which are also preferred sites of gene conversion in DT40 (unpublished results). Approximately 
half of all point mutations fall within the RGYW (R « A/G; Y = C/T; W = A/T) sequence motif or 

15 its complement WRCY (Fig. 4C), known as hot spots of Ig hypermutation in humans and mice. 

It was previously reported that the deletion of RAD51 paralogues induces Ig hypermutation 
in DT40 (Sale et al., 2001). To compare the hypermutation activity in the \j/V gene negative and 
RAD51 paralogue negative backgrounds, the XRCC3 gene was disrupted in the DT40 Crel clone 

20 and the rearranged VJ genes were sequenced 6 weeks after subcloning. Similar to the mutation 

spectrum in the AID R \j/V" clone and what was previously reported (Sale et al., 2001), the mutations 
in the sequences from the XRCC3" /_ cells show a transversion preference and an absence of 
mutations in A/T bases (Fig. 4B). Nevertheless the mutation rate in the XRCC3 mutant was about 
2.5 fold lower than in the AID*Vv~ clone and there was a clear slow growth phenotype of the 

25 XRCC3 mutant compared to wild-type DT40 and the AID R i|AT clone (Fig. 4D). 

To identify the mutations responsible for the loss of slgM expression in the AID R \|/V clone, 
94 light chain cDNAs from sorted slgM(-) cells were amplified and sequenced. Although one 
short insertion and five deletions were detected in this collection (Table 1), 89% of the 245 total 
30 mutations are single nucleotide substitutions within the VJ segments (Fig. 5). Surprisingly, only 
about 10% of the sequences contained a stop codon or a frameshift, suggesting that the lack of 
slgM(-) expression is mainly caused by amino acid substitutions which affect the pairing of the Ig 
light and heavy chain proteins. 
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Ig locus specificity of hypermutation 

It has been reported that high AID expression in fibroblasts (Yoshikawa et al., 2002) and B 
cell hybridomas (Martin and ScharfT, 2002) leads to frequent mutations in transfected transgenes. 
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To rule out that the pseudogene deletions had induced a global hypermutator phenotype, the 5' 
ends of the genes encoding the B cell specific marker Bu-1 and the translation elongation factor 
EFlcc were sequenced for the AID R \j/V" clone. Only a single one bp deletion was found within 95 
sequences of the Bu-1 gene and only two single nucleotide substitutions within 89 sequences of 
5 EFla (Table 1). As these changes most likely represent PCR artifacts, this further supports the 
view that the hypermutations induced by the \\i V deletions are Ig locus specific. 

Discussion 

The results demonstrate that the deletion of the nearby pseudogene donors abolishes Ig gene 
10 conversion in DT40 and activates a mutation activity which closely resembles Ig hypermutation. 
The features shared between the new activity and somatic hypermutation include 1) AID 
dependence, 2) a predominance of single nucleotide substitutions, 3) distribution of the mutations 
within the 5' transcribed region, 4) a preference for hotspots and 5) Ig gene specificity. The only 
difference with regard to Ig hypermutation in vivo is the relative lack of mutations in A/T bases 
15 and a predominance of transversion mutations in the \|/V knock-out clones. However, this 

difference is also seen in hypermutating EBV transformed B cell lines (Bachl and Wabl, 1996; 
Faili et al., 2002) and DT40 mutants of RAD5 1-paralogues (Sale et al., 200 1) indicating that part 
of the Ig hypermutator activity is missing in transformed B cell lines. Interestingly, the rate of Ig 
hypermutation in the AID R v|/V" clone seems higher than the rate of Ig gene conversion in the 
20 DT40 Crel progenitor. An explanation for this could be that some conversion tracts are limited to 
stretches of identical donor and target sequences and thus leave no trace. 

The induction of Ig hypermutation by the blockage of Ig gene conversions supports a simple 
model explaining how hypermutation and recombination is initiated and regulated (Fig. 6). At the 

25 top of the events is a modification of the rearranged V(D)J segment which is either directly or 

indirectly induced by AID. The default processing of this lesion in the absence of nearby donors or 
in the absence of high homologous recombination activity leads to Ig hypermutation in form of a 
single nucleotide substitution (Fig. 6, right side). However, if donor sequences are available, 
processing of the AID induced lesion can be divided into a stage before strand exchange, when a 

30 shift to Ig hypermutation is still possible and a stage after strand exchange when the commitment 
toward Ig gene conversion has been made (Fig. 6, left side). Whereas completion of the first stage 
requires the participation of the RAD51 paralogues, the second stage involves other recombination 
factors like the RAD54 protein. 

35 This difference in commitment explains why disruptions of the RAD51 paralogues not only 

decrease Ig gene conversion, but also induce Ig hypermutation (Sale et al., 2001) whereas 
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disruption of the RAD54 gene only decreases Ig gene conversion (Bezzubova et al., 1997). The 
model also predicts that low cellular homologous recombination activity prevents Ig gene 
conversion even in the presence of conversion donors. Such a low homologous recombination 
activity might be the reason why human and murine B cells never use Ig gene conversion despite 
5 the presence of nearby candidate donors in form of unrearranged V segments and why chicken 
germinal center B cells shave shifted from Ig gene conversion to Ig hypermutation (Arakawa et al., 
1998). 

The AID R and the vj/V knock-out DT40 clones are a powerful experimental system to 
10 address the role of trans-acting factors and cis-acting regulatory sequences for Ig gene conversion 
and hypermutation. Compared to alternative animal or cell culture systems it offers the advantages 
of: 1) parallel analysis of Ig gene conversion and Ig hypermutation, 2) conditional AID expression, 
3) easy genome modifications by gene targeting, 4) normal cell proliferation and repair proficiency 
and 5) Ig locus specificity of hypermutation. The ability to induce gene specific hypermutation in 
15 the DT40 cell line might also find applications in biotechnology. One possibility is to replace the 
chicken antibody coding regions by their human counterparts and then to simulate antibody 
affinity maturation from a repertoire which continuously evolves by Ig hypermutation. 

2. Targeted in vivo mutagenesis of GFP by gene conversion and 
20 hypermutation 

The gene encoding Green Fluorescent Protein (GFP) is an example of a target nucleic acid which 
can be genetically diversified using the cell system of the invention, in particular the DT40 cell 
line. The GFP gene inserted into the Ig light chain locus by targeted integration will be subjected 
25 to hypermutation and its activity with respect to color, intensity and half-life will evolve with time 
(Fig. 7B). If a combination of hypermutation and gene conversion is used to modify the GFP 
activities, variant GFP sequences which can serve as gene conversion donors for GFP are also 
inserted into the Ig locus (Fig. 7D). 

An Ig VJ replacement vector, pVJRepBsr, which allows to replace the Ig light chain VJ gene by 
30 any nucleic acid target is depicted in Fig. 7A. A potential target for mutagenesis can be cloned into 
Spel site, which is compatible with Xbal, Nhel, Avrll and Spel sites. For example, the GFP gene 
can be inserted into the Ig light chain locus by targeted integration using pVJRepBsr. A Vj/V-gene 
donor replacement vector, pPseudoRepBsr, which allows to replace the Ig v|/V gene light chain 
locus by any nucleic acid target is depicted in Fig. 7C. Potential gene conversion donors can be 
35 cloned into either Nhel or Spel site, which is compatible with Xbal, Nhel, Avrll and Spel sites. 
Because Nhel site is located between two loxPs, this site can be used for conditional knockout 
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design. By stepwise targeted integration using pPseudoRepGpt and pVjRepBsr, V|/V genes can be 
replaced by vyGFP gene and its variants (e.g. i|/CFP: cyano fluorescence protein and v|/YFP: yellow 
fluorescence protein) and the VJ gene can be replaced by GFP carrying a frameshift mutation 
(FsGFP) to monitor genetic diversification of the GFP gene. The frameshift in FsGFP is expected 
5 to be repaired by gene conversion of \|/GFP, ij/CFP and yYFP as templates. In addition, the gene 
will be further diversified by hypermutation. 
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1 .A genetically modified lymphoid cell having gene conversion fully or partially replaced by 
hypermutation, wherein said cell has no deleterious mutations in genes encoding paralogues 
and analogues of the RAD5 1 protein. 
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2 .The cell according to claim 1, wherein the cell contains wild-type homologous recombination 
activity. 

3. The cell according to claim 1 or 2, wherein the cell has an unaffected proliferation rate. 

4. The cell according to any one of claims 1 to 3, wherein the cell is DNA repair proficient. 

5 5. The cell according to any one of claims 1 to 4, wherein the cell is an immunoglobulin-expressing 
B cell. 

6. The cell according to any one of claims 1 to 5, wherein the cell is derived from chicken, sheep, 

cow, pig or rabbit. 

7. The cell according to any one of claims 1 to 6, wherein the cell is a chicken Bursal lymphoma 
10 cell. 

8. The cell according to any one of claims 1 to 7, wherein the cell is a DT40 cell or a derivative 

thereof. 

9. The cell according to any one of claims 1 to 8, wherein the cell expresses activation-induced 

deaminase (AID). 

15 lO.The cell according to any one of claims 1 to 9, wherein the cell is capable of directed and 

selective genetic diversification of a target nucleic acid by hypermutation or a combination of 
hypermutation and gene conversion. 

1 1 .The cell according to claim 10, wherein the target nucleic acid encodes a protein or exercises a 
regulatory activity. 

20 12.The cell according to claim 1 1, wherein the target nucleic acid encodes an immunoglobulin 
chain, a selection marker, a DNA-binding protein, an enzyme, a receptor protein, or a part 
thereof. 

13.The cell according to claim 12, wherein the target nucleic acid is a human immunoglobulin V- 
gene or a part thereof. 

25 14.The cell according to claim 1 1, wherein the target nucleic acid contains a transcription 
regulatory element or an RNAi sequence. 

15. The cell according to any one of claims 10 to 14, wherein the cell further contains at least one 
sequence capable of serving as a gene conversion donor for the target nucleic acid. 
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16. The cell according to any one of claims 10 to 15, wherein the target nucleic acid is integrated 

into the chromosome at a defined location by targeted integration. 

17. The cell according to any one of claims 1 1 to 16, wherein the target nucleic acid is operably 

linked to control nucleic acid sequences that direct genetic diversification. 

5 18. The cell according to any one of claims 1 1 to 17, wherein the cell expresses the target nucleic 
acid in a manner that facilitates selection of cells comprising mutants of said nucleic acid 
having a desired activity. 

19.The cell according to claim 18, wherein the selection is a direct selection for the activity of the 
target nucleic acid within the cell, on the cell surface or outside the cell. 

10 20.The cell according to claim 18, wherein the selection is an indirect selection for the activity of a 
reporter nucleic acid. 

21. The cell according to any one of claims 10 to 20, wherein genetic diversification of the target 

nucleic acid by gene conversion and hypermutation is modulated by genetic manipulation. 

22. The cell according to claim 21, wherein the modulation is by cis-acting regulatory sequences. 

15 23. The cell according to claim 20 or 22, wherein the modulation is by varying the number, the 
orientation, the length or the degree of homology of the gene conversion donors. 

24 . The cell according to any one of claims 20 to 23, wherein the modulation is by a trans-acting 

regulatory factor. 

25. The cell according to claim 24, wherein the trans-acting regulatory factor is activation-induced 
20 deaminase (AID). 

26. The cell according to claim 24, wherein the trans-acting factor is a DNA repair or 

recombination factor other than a RAD5 1 paralogue or analogue. 

27. A cell line derived from the cell of any one of claims 1 to 26. 

28. A non-human transgenic animal containing a lymphoid cell having gene conversion fully or 
25 partially replaced by hypermutation, wherein said cell has no deleterious mutations in genes 

encoding paralogies and analogues of the RAD51 protein, and wherein said cell is capable of 
directed and selective genetic diversification of a transgenic target nucleic acid by 
hypermutation or a combination of hypermutation and gene conversion. 
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29. A method for preparing a cell capable of directed and selective genetic diversification of a 

target nucleic acid by hypermutation or a combination of hypermutation and gene conversion 
comprising (a) transfecting a lymphoid cell capable of gene conversion with a genetic 
construct containing the target nucleic acid, and (b) identifying a cell having the endogenous 
5 V-gene or a fragment thereof replaced with the target nucleic acid. 

30. The method according to claim 29, wherein the genetic construct containing the target nucleic 

acid further contains at least one nucleic acid capable of serving as a gene conversion donor 
for the target nucleic acid. 

3 l.The method according to claim 29 or 30, wherein the locus containing the target nucleic acid is 
10 constructed by multiple rounds of transfection. 

32. The method according to any one of claims 29 to 3 1 further comprising (c) transfecting the cell 

from step (b) with a further genetic construct comprising a reporter gene capable of being 
influenced by the target nucleic acid. 

33. The method according to any one of claims 29 to 32 further comprising (d) conditional 
1 5 expression of a trans-acting regulatory factor. 

34. The method of claim 33, wherein the trans-acting regulatory factor is activation-induced 

deaminase (AID). 

35. The method according to any one of claims 29 to 34, wherein the target nucleic acid is inserted 

into the cell chromosome at a particular location by targeted integration. 

20 36.A method for preparing a gene product having a desired activity, comprising the steps of: (a) 

culturing cells according to any one of claims 1 1 to 26 under appropriate conditions to express 
the target nucleic acid, (b) identifying a cell or cells within the population of cells which 
expresses a mutated gene product having the desired activity; and (c) establishing one or more 
clonal populations of cells from the cell or cells identified in step (b), and selecting from said 

25 clonal populations a cell or cells which express(es) a gene product having an improved 

desired activity. 

37. The method according to claim 36, wherein steps (b) and (c) are iteratively repeated. 

38. The method according to claims 36 or 37 further comprising the step of switching off genetic 

diversification. 
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39. The method according to any one of claims 36 to 38, wherein the diversification of the target 

nucleic acid is further modified by target sequence optimization. 

40. The method according to any one of claims 36 to 39, wherein the genetic diversification is 

switched off by down-regulation of the expression of a trans-acting regulatory factor. 

41. The method according to claim 40, wherein the trans-acting regulatory factor is activation- 

induced deaminase (AID). 

42. Use of the cell according to any one of claims 10 to 26 or a cell line according to claim 27 for 

the preparation of a gene product having a desired activity. 
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SUMMARY 

The invention relates to a modified lymphoid cell having gene conversion fully or partially 
5 replaced by hypermutation, wherein said cell has no deleterious mutations in genes encoding 
paralogues and analogues of the RAD5 1 protein, and wherein said cell is capable of directed and 
selective genetic diversification of a target nucleic acid by hypermutation or a combination of 
hypermutation and gene conversion. The invention also relates to a method for diversifying any 
transgenic target gene in said cell. Preferably, the target gene is integrated into the 
10 immunoglobulin light or heavy chain locus by targeted integration. 
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<140> 10/590,211 
<141> 2006-08-22 

<150> PCT/EP05/01897 
<151> 2005-02-23 
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<151> 2004-02-23 

<160> 16 

<170> Patentin ver. 3.3 

<210> 1 
<211> 476 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Synthetic 
polynucleotide 

<220> 

<223> see specification as filed for detailed description of 
substitutions and preferred embodiments 

<400> 1 

tccctggtgc aggcagcgct gactcagccg gcctcggtgt cagcaaaccc aggagaaacc 60 
gtcaagatca cctgctccgg gggtggcagc tatgctggaa gttactatta tggctggtac 120 
cagcagaagt ctcctggcag tgcccctgtc actgtgatct atgacaacga caagagaccc 180 
tcggacatcc cttcacgatt ctccggttcc aaatccggct ccacagccac attaaccatc 240 
actggggtcc gagccgatga cgaggctgtc tatttctgtg ggagctacga agacaacagt 300 
ggtgctgcat ttggggccgg gacaaccctg accgtcctag gtgagtcgct gacctcgtct 360 
cggtctttct tcccccatcg tgaaattgtg acattttgtc gatttttggt gatttggggg 420 
tttttcttgg acttggcggc aggctggggt ctgccaccgg cgcagggccg ggcact 476 



<210> 2 
<211> 476 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
polynucleotide 

<220> 

<223> see specification as filed for detailed description of 
substitutions and preferred embodiments 



<400> 2 

tccctggtgc aggcagcgct gactcagccg gcctcggtgt cagcaaaccc aggagaaacc 60 
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gtcaagatca 
cagcagaagt 
tcggacatcc 
actggggtcc 
ggtgctgcat 
cggtctttct 
tttttcttgg 



cctgctccgg 
ctcctggcag 
cttcacgatt 
gagccgatga 
ttggggccgg 
tcccccatcg 
acttggcggc 



gggtggcagc 
tgcccctgtc 
ctccggttcc 
cgaggctgtc 
gacaaccctg 
tgaaattgtg 
aggctggggt 



2102702 
tatgttggaa 
actgtgatct 
aaatccggct 
tatttctgtg 
accgtcctag 
acattttgtc 
ctgccaccgg 
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gttactatta 
atgacaacga 
ccacagccac 
ggagcactgc 
gtgagtcgct 
gatttttggt 
cgcagggccg 



tggctggtac 120 
caagagaccc 180 
attaaccatc 240 
agacaacagt 300 
gacctcgtct 360 
gatttggggg 420 
ggcact 476 



<210> 3 
<211> 12 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 3 
ggtagcggct at 



Synthetic 



12 



<210> 4 
<211> 58 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 4 

aacctgggag gaaccgtcga gatcacctgc tccgggggtt acagcggcta tgcctgga 



58 



<210> 5 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 5 

tggaagttac tattatggct ggtacc 



26 



<210> 6 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 6 

tataatggca ataacaga 



18 



<210> 7 
<211> 33 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 7 

atctactatg atgatgagag accctcgaac ate 



<210> 8 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 8 

ggttctggat ccggctccac aaacaca 



<210> 9 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 9 
gggtacgaag ac 



<210> 10 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 10 

gctatctatt actgt 



<210> 11 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 11 

gggagctgea tagac 
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<210> 12 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 12 

tattactgtg ggtgcataga c 



<210> 13 
<211> 34. 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 

<400> 13 

gggaagcttg atcatttcct gaatgctata ttca 



<210> 14 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 

<400> 14 

gggtctagaa actcctaggg gaaactttgc tgag 



<210> 15 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 

<400> 15 

gggaagcttc ggaagaaaga agctaaagac catc 



<210> 16 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primer 

<400> 16 

ggggctagca gaagagcgtg ctcacgggtc tgcc 
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